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Abstract 

We study various SDP formulations for Vertex Cover by adding different constraints to 
the standard formulation. We show that Vertex Cover cannot be approximated better than 
2 — o(l) even when we add the so called pentagonal inequality constraints to the standard 
SDP formulation, en route answering an open question of Karakostas 14 . We further show 
the surprising fact that by strengthening the SDP with the (intractable) requirement that the 
metric interpretation of the solution is an l\ metric, we get an exact relaxation (integrality gap 
is 1), and on the other hand if the solution is arbitrarily close to being £1 embeddable, the 
integrality gap may be as big as 2 — o(l). Finally, inspired by the above findings, we use ideas 
from the integrality gap construction of Charikar to provide a family of simple examples for 
negative type metrics that cannot be embedded into l\ with distortion better than 8/7 — e. To 
this end we prove a new isoperimetric inequality for the hypercube. 



1 Introduction 



A vertex cover in a graph G = (V, E) is a set S C V such that every edge e £ E intersects 
S 1 in at least one endpoint. Denote by vc(G) the size of the minimum vertex cover of G. It is 
well-known that the minimum vertex cover problem has a 2-approximation algorithm, and it is 
widely believed that for every constant e > 0, there is no (2 — e)-approximation algorithm for this 
problem. Currently the best known hardness result for this problem shows that 1.36-approximation 
is NP-hard |lUj . If we were to assume the Unique Games Conjecture [T^], the problem would be 
essentially settled as 2 — would then be NP-hard |16j . 

In a seminal paper, Goemans and Williamson ^2] introduced semidefinite programming as a 
tool for obtaining approximation algorithms. Since then semidefinite programming has been applied 
to various approximation problems and has become an important technique, and indeed the best 
known approximation algorithms for many problems are obtained by solving an SDP relaxation of 
them. 

The best known algorithms for Vertex Cover compete in "how big is the little o" in the 
2 — o(l) factor. The best two are in fact based on SDP relaxations: Halperin ^3] gives a (2 — 
log log A/ log A)-approximation where A is the maximal degree of the graph while Karakostas 
obtains a (2 — Q(l /\/log n))-approximation |14j . 

The standard way to formulate the Vertex Cover problem as a quadratic integer program is 
the following: 

Min £iev(l + TO)/2 
s.t. (xi — xo)(xj — xq) = V ij £ E 

Si €{-1,1} Vie{o}uy, 

where the set of the vertices i for which xi = xq correspond to the vertex cover. By relaxing this 
integer program to a semidefinite program, the scalar variable xi now becomes a vector Vj and we 
get: 

Min E ie y(l + v v 4 )/2 

s.t. (vi - v ) • (vj - v ) = V ij G E (1) 

IH| = l Vie {0} u v. 

Kleinberg and Goemans 18] proved that SDP (^Q) has integrality gap of 2 — o(l). Specifically, 
given e > 0, they construct a graph G e for which vc(G e ) is at least (2 — e) times larger than the 
optimal solution to the SDP. They also suggested the following strengthening of SDP (JTJ) and left 
its integrality gap as an open question: 

Min E te v( 1 + v ov l )/2 
s.t. (Vj - v ) • (Vj - v ) = 
(Vj - vjfc) • (Vj- - vfe) > 
l|Vi|| = 1 

Charikar answered this question by showing that the same graph G t but a different vector 
solution satisfies SDP 1 and gives rise to an integrality gap of 2 — o(l) as before. The following 

1 To be more precise, Charikar's proof was for a slightly weaker formulation than but it is not hard to see that 
the same construction works for SDP J5J as well. 
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V i € {0} U V. 
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is an equivalent formulation to SDP ©: 
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V i,j,k G {0} U V 

V i G {0} U V 
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Viewing SDPs as relaxations over l\ The above reformulation reveals a connection to metric 
spaces. The second constraint in SDP (j3J) says that || • || 2 induces a metric on {vj : i G {0} U V^}, 
while the first says that vo is on the shortest path between the images of every two neighbours. This 
suggests a more careful study of the problem from the metric viewpoint which is the purpose of this 
article. Such connections are also important in the context of the Sparsest Cut problem, where 
the natural SDP relaxation was analyzed in the breakthrough work of Arora, Rao and Vazirani 
[3] and it was shown that its integrality gap is at most 0{\f\ogn). This later gave rise to some 
significant progress in the theory of metric spaces 0] . 

For a metric space (X, d) , let c\ (X, d) denote the minimum distortion required to embed (X, d) 
into l\ (see |2U| for the related definitions). So c\(X, d) = 1 if and only if (X, d) can be embedded 
isometrically into i\. Consider a vertex cover S and its corresponding solution to SDP (|2"|). i.e., 
Vj = 1 for every i G S U {0} and Vj = —1 for every i G" S. The metric defined by || • || 2 on this 
solution (i.e., d(i,j) = ||vj — Vj|| 2 ) is isometrically embeddable into t\. Thus we can strengthen SDP 
(J2J) by allowing any arbitrary list of valid inequalities in l\ to be added. For example the triangle 
inequality, is one type of such constraints. The next natural inequality of this sort is the pentagonal 
inequality: A metric space (X, d) is said to satisfy the pentagonal inequality if for S, T C X of sizes 
2 and 3 respectively it holds that EieSj'eT > Yli j^s^iiij) + Y^i jer^i^j)- Note that this 

inequality does no longer apply to every metric, but it does to ones that are i\ embeddable. This 
leads to the following natural strengthening of SDP ©: 

V ij G E 
V S, T C {0} U V, (4) 
\S\ = 2,\T\ = 3 

V i 6 {0} U V 

In Theorem 03 we prove that SDP Q has an integrality gap of 2 — e, for every e > 0. It is 
interesting to note that for the classical problem of Sparsest Cut, it is not known how to show 
a nonconstant integrality gap against pentagonal (or any other A;-gonal) inequalities, although 
recently a nonconstant integrality gap was shown by Khot and Vishnoi and later by Devanur et al. 
[T71 |HI in the presence of the triangle inequalities 2 . 

One can actually impose any £i-constraint not only for the metric defined by {vj : i G FU{0}}, 
but also for the one that comes from {vj : i G yu{0}}U{— Vj : i G yu{0}}. This fact is used in fI3] 
where the triangle inequality constraints on this extended set are added, achieving an integraility 
gap of at most 2 — 0(-^===). It is also asked whether the integrality gap of this strengthening 
breaks the "2 — o(l) barrier". In Section [4.21 we answer this question in the negative. 

As Khot and Vishnoi note, and leave as an open problem, it is possible that their example satisfies some or all 
fc-gonal inequalities. 
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Integrality gap with respect to t\ embeddability At the extreme, strengthening the SDP 
with ^i-valid constraints, would imply the condition that the metric defined by || • || on {vj : i G 
{0}UV}, namely d(i,j) = ||v, — Vj|| 2 is l\ embeddable. Doing so leads to the following intractable 
program (which we refer to as SDP for convenience): 

Min E lG yl" l|v - Vi|| 2 /4 

s.t. ||vj - v || 2 + ||Vj - v || 2 = ||vj - Vj|| 2 V ij G E 

||vi|| = l ViG{0}UF lJ 
ci({v i: iG{0}UF},||-|| 2 ) = l 

In it is shown that an SDP formulation of Minimum Multicut, even with the constraint 
that the || • || 2 distance over the variables is isometrically embeddable into t\, still has a large 
integrality gap. For the Max Cut problem, which is more intimately related to our problem, it is 
easy to see that the t\ embeddability condition does not prevent the integrality gap of 8/9; it is 
therefore tempting to believe that there is a large integrality gap for SDP © as well. Surprisingly, 
SDP © has no gap at all; in other words, as we show in Theorem[2l the answer to SDP Q is exactly 
the size of the minimum vertex cover. A consequence of this fact is that any feasible solution to 
SDP (J2J that surpasses the minimum vertex cover induces an £| distance which is not isometrically 
embeddable into l\. This includes the integrality gap constructions of Kleinberg and Goemans', 
and that of Charikar's for SDPs (J2J) and (jHJ) respectively. The construction of Charikar is more 
interesting in the sense that the obtained £ 2 distance is a metric (from now on we refer to it as a 
negative type metric; see [Hj for background and nomenclature). In contrast to Theorem |2 we show 
in Theorem El that if we relax the last constraint in SDP Q to ci({vj : % G {0} U V}, || • || 2 ) < 1 + 6 
for any constant 6 > 0, then the integrality gap may "jump" to 2 — o(l). Compare this with a 
problem such as Sparsest Cut in which an addition of such a constraint immediately implies 
integrality gap at most 1 + 6. 

Negative type metrics that are not i\ embeddable Inspired by the above results, we con- 
struct in Theorem a simple negative type metric space (X, \\ • || 2 ) that does not embed well into 
t\. Specifically, we get c\(X) > I — e for every e > 0. In order to show this we prove a new isoperi- 
metric inequality for the hypercube Q n = {—1, l} n , which we believe is of independent interest. 
This theorem generalizes the standard one, and under certain conditions provides better guarantee 
for edge expansion: 

Theorem 1 (Generalized Isoperimetric inequality) For every set S C Q n; 

\E(S,S C )\ > \S\(n-log 2 \S\)+p(S), 
where p(S) denotes the number of vertices u G S such that — u G S. 

Khot and Vishnoi |17j constructed an example of an n-point negative type metric that for every 
6 > requires distortion at least (log log n) 1 / 6 "" 5 to embed into t\. Krauthgamer and Rabani |19j 
showed that in fact Khot and Vishnoi's example requires a distortion of at least f2(loglogn). Later 
Devanur et al. 8 showed an example which suffers an J7(loglogn) distortion even on average 
when embedded into t\ (we note that our example is also "bad" on average). Although the above 
examples require nonconstant distortion to embed into £\, we believe that Theorem is interesting 
for the following reasons: (i) Khot and Vishnoi's example is quite complicated, and there is no good 
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explanation to the fact that triangle inequality holds (citing the authors "this is where the magic 
happens"). Simple constructions such as the one we obtain may give a better understanding of the 
problem and lead to simpler constructions of negative type metrics that behave poorly in the above 
sense (ii) there are not many known examples of negative type metrics that require a constant c > 1 
distortion to embed into £±, and finding such examples is challenging and desirable. In fact before 
Khot and Vishnoi's result, the best known lower bounds (see [IZj) were due to Vempala, 10/9 for a 
metric obtained by a computer search, and Goemans, 1.024 for a metric based on the Leech Lattice 
(compare these to the 8/7 — e bound of Theorem EJ). We mention that by [I] every negative type 
metric embeds into i\ with distortion 0(\/log n log log n) . 

2 Preliminaries and notation 

A vertex cover of a graph G is a set of vertices that touch all edges. An independent set in G is 
a set I Q V such that no edge e G E joins two vertices in /. We denote by a{G) the size of the 
maximum independent set of G. Vectors are always denoted in bold font (such as v, w, etc.); ||v|| 
stands for the Euclidean norm of v, u • v for the inner product of u and v, and u ® v for their 
tensor product. Specifically, if v, u G M n , ugvis the vector with coordinates indexed by ordered 
pairs G [n] 2 that assumes value UjVj on coordinate Similarly, the tensor product of 

more than two vectors is defined. It is easy to see that (u ® v).(u' (g) V) = (u • u')(v • v'). For two 
vectors u G R n and v G M m , denote by (u, v) G M n+m the vector whose projection to the first n 
coordinates is u and to the last m coordinates is v. 

Next, we give a few basic definitions and facts about finite metric spaces. A metric space 
(X, dx) embeds with distortion at most D into (Y, dy) if there exists a mapping <j> : X Y so that 
for all a,b G X 7 • dx(a,b) < dy (<fi(a) , <fr(b)) < 7-D • dx(a,b), for some 7 > 0. We say that (X,d) 
is t\ embeddable if it can be embedded with distortion 1 into R m equipped with the l\ norm. An 
£2 distance on A is a distance function for which there there are vectors \r x G M m for every x G X 
so that d(x,y) = \\v x — v y || 2 . If, in addition, d satisfies triangle inequality, we say that d is an 
l\ metric or negative type metric. It is well known [2] that every l\ embeddable metric is also a 
negative type metric. 

3 i\ and Integrality Gap of SDPs for Vertex Cover — an "all or 
nothing" phenomenon 

It is well known that for the Sparsest Cut problem, there is a tight connection between t\ 
embeddability and integrality gap. In fact the integrality gap is bounded above by the least l\ 
distortion of the SDP solution. At the other extreme stand problems like Max Cut and Multi 
Cut, where £\ embeddability does not provide any strong evidence for small integrality gap. In this 
section we show that Vertex Cover falls somewhere between these two classes of ^i-integrality 
gap relationship, and it witnesses a sharp transition in integrality gap in the following sense: while i\ 
embeddability prevents any integrality gap, allowing a small distortion, say 1.001 does not prevent 
integrality gap of 2 — o(l)! 

Theorem 2 For a graph G = (V,E), the answer to the SDP formulated in SDP |2J) is the size of 
the minimum vertex cover of G. 
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Proof. Let d be the metric solution of SDP We know that d is the result of an £?, unit 
representation (i.e., it comes from square norms between unit vectors), and furthermore it is t\ 
embeddable. By a well known fact about i\ embeddable metrics (see, eg, [S]) we can assume that 
there exist At > and ft : {0} U7-> {—1, 1}, t = 1, . . . , m, such that 

m 

l|vi-v J || 2 = ^A t |/ t (i)-/ t (i)|, (6) 
t=i 

for every i,j G {0} U V. Without loss of generality, we can assume that /t(0) = 1 for every t. For 
convenience, we switch to talk about Independent Set and its relaxation, which is the same as 
SDP © except for the objective function that becomes Max^ ieV || v — Vj|| 2 /4. Obviously, the 
theorem follows from showing that this is an exact relaxation. 

We argue that (i) It = {i G V : ft(i) = — 1} is a (nonempty) independent set for every t, and 
(ii) ^2 \t = 2. Assuming these two statements we get 

^||vi-v || 2 ^ XXi A t |l - ft(i)\ A A t |/ t | 

> = > — — = > < max \I t \ < a(G), 

iav i&v t=i 1 1 

and so the relaxation is exact and we are done. 

We now prove the two statements. The first is rather straightforward: For i,j £ It, © implies 
that d(i,0) + d(0, j) > d(i,j). It follows that ij cannot be an edge else it would violate the first 
condition of the SDP. (We may assume that It is nonempty since otherwise the ft(-) terms have no 
contribution in <[B)l.) The second statement is more surprising and uses the fact that the solution is 
optimal. The falsity of such a statement for the problem of Max Cut (say) explains the different 
behaviour of the latter problem with respect to integrality gaps of l\ embeddable solutions. We 
now describe the proof. 

Let v[ = (y / Ai/2/i(i), . . . , y 7 \ m /2f m (i), 0). From © we conclude that ||v- — v^|| 2 = ||vj — v,-|| 2 , 
hence there exists a vector w = (wi, W2, to m+ i) G M m+1 and an orthogonal transformation T, 
such that 

Vi = T {V t + w) . 

Since the constraints and the objective function of the SDP are invariant under orthogonal trans- 
formations, without loss of generality we may assume that 

Vi = V- + w, 

for i G V U {0}. We know that 

m 

1 = INI 2 = |K + w|| 2 = w 2 m+1 + £(VV2/ t (i) + w t ) 2 . (7) 

t=i 

Since ||v^|| 2 = ||vq|| 2 = Ylt+i ^t/%> f° r every i G VU {0}, from (J7| we get Vq • w = v^ • w. Summing 
this over alH G V, we have 

m 

\V\ (v' • w) = < ■ w = Ed^l - 2|/t|)yV2^, 
iev *=i 
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or 

m m 

\V\y^/2w t = J2i\V\ ~ 2\I t \)y^/2w t , 
t=l i=l 

and therefore 

m 

Y,\h\Vk/2w t = 0. (8) 

Now © and © imply that 

771 ffi / \ I r I \ In 

max \I t \ > >/V2/t(0) + w t f\I t \ = £ ( M*L + v$\I t \ ) > £ 

* 6M t=l H V 7 t=l 



t=l 



(9) 



As we have observed before 

m \ I r I II || 2 

E A t| J f I _ ll v i — v 0|| 
2 ~~ ^ 4 

t=l i£V 

II II 2 

which means (as clearly Yli^v 4 — a (G)) that the inequalities in © must be tight. Now, 
since \It\ > we get that w = and from (jJJ) we get the second statement, i.e., = 2. This 

concludes the proof. ■ 
Now let us replace the last constraint in SDP (JSJ), ci({vj : i S {0} U V}, || • || 2 ) = 1, with a 
weaker condition ci({vj : i S {0} U V}, || • || 2 ) < 1 + 5, for arbitrary 5 > 0. 

Min E te v 1 - ll v o-v 4 || 2 /4 

s.t. ||vj — vo II 2 + || v j — v o|| 2 = || v i — v jl| 2 V ij £ E 

||vi|| = l Vie{0}UF 
Cl ({v 4 : i E {0}uy},|| • || 2 ) < 1 + 5 

Theorem 3 For every e > 0, there is a graph G for which ^[g) — 2 — e, where sd(G) is i/ie solution 
to the above SDP 



For the proof we show that the negative type metric implied by Charikar's solution (after 
adjusting the parameters appropriately) requires distortion of at most 1 + 5. We postpone the 
proof to the appendix. 



4 Integrality Gap against the stronger Semi Definite formulations 

In this section we discuss the integrality gap for stronger semi-definite formulations of vertex cover. 
In particular we show that Charikar's construction satisfies both SDPs (|1U|) and (|4"|). We start by 
describing this construction. 



4.1 Charikar's construction 

The graphs used in the construction are the so called Hamming graphs. These are graphs with 
vertices {—1, 1}™ and two vertices are adjacent if their Hamming distance is exactly an even integer 
d = jn. A result of Frankl and Rodl [11] shows that vc(G) > 2 n — (2 — S) n , where 5 > is a 
constant depending only on 7. Kleinberg and Goemans ^Hj showed that by choosing proper n and 
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7, this graph gives an integrality gap of 2 — e for SDP lfl|). Charikar (Hj showed that in fact G 
implies the same result for the SDP formulation in © too. To this end he introduced the following 
solution to SDP ©: 

For every Uj £ {— 1, l} n , define = Ui/^/n, so that • u'^ = 1. Let A = 1 — 27, q(x) = 
x 2t + 2iA 2 '~ 1 x and define y = (0, . . . , 0, 1), and 



where (5 will be determined later. Note that yj is normalized to satisfy ||yj|| = 1. 

Moreover yj is defined so that yj • y^ takes its minimum value when ij £ E, i.e., when • u^- = 
—A. As is shown in .6], for every e > we may set t = S7(i),/3 = 0(l/i),7 = it to get that 
(yo - Yi) • (yo - yj) = for ij e E, while (y - yj) • (y - yj) > always. 

Now we verify that all the triangle inequalities, i.e., the second constraint of SDP (J2J) are 
satisfied: First note that since every coordinate takes only two different values for the vectors 
in {yj : i 6 V}, it is easy to see that ci({yj : i G V}, \\ • || 2 ) = 1. So the triangle inequality 
holds when i,j,k £ V. When i = or j = 0, the inequality is trivial, and it only remains to 
verify the case that k = 0, i.e., (yo — yj) • (yo — yj) > 0, which was already mentioned above. 

Now EievC 1 + yo • y«)/2 = ^ir ' 1^1 = (3 + °( e )) \ v l where b y the result of Fr ankl and Rodl 
vc(G) = (l-o(l))|F|. 

4.2 Karakostas' and Pentagonal SDP formulations 

Karakostas suggests the following SDP relaxation, that is the result of adding to SDP (j3J) the 
triangle inequalities applied to the set {vj : i £ V U {0}} U {— Vj : i £ V U {0}}. 



We prove that this variant has integrality gap 2 — o(l) by showing that Charikar 's construction 
satisfies SDP (flO)) . We postpone the proof to the appendix. 

Theorem 4 The integrality gap of SDP U(J\) is bigger than 2 — e, for any e > 0. 

By now we know that taking all the l\ constraints leads to an exact relaxation, but clearly one 
that is not tractable. Our goal here is to explore the possibility that stepping towards i\ embed- 
dability while still maintaining computational feasibility would considerably reduce the integrality 
gap. A canonical set of valid inequalities for l\ metrics is the so called Hypermetric inequalities. 
Metrics that satisfy all these inequalities are called hypermetrics. Again, taking all these constraints 
is not feasible, and yet we do not know whether this may lead to a better integrality gap (notice 
that we do not know that Theorem |2] remains true if we replace the l\ embeddability constraints 
with a hypermetricity constraint). See [S] for a related discussion about hypermetrics. We instead 




Min Ei e y(l + v Vj)/2 
s.t. (Vj - v ) • (Vj - v Q ) = 



V ij eE 



(vj - v fe ) • (vj - v*.) > 
(vj + v fe ) • (vj - vj;) > 
(vj + v fc ) • (Vj + Vfc) > 
INI = 1 



Vi,j,kev 

V i,j,k £ V 

V i,j,k £ V 

V i £ {0} U V. 



(10) 
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consider the effect of adding a small number of such constraints. The simplest hypermetric in- 
equalities beside triangle inequalities are the pentagonal inequalities. These inequalities consider 
two sets of points in the space of size 2 and 3, and require that the sum of the distances between 
points in different sets is at least the sum of the distances within sets. Formally, let S,T C X, 

\ s \ = 2 > l T l = 3 > tnen we nave tne inequality EieSjeT ^ X)»j 6 s d (*>j) + J2ijeT d (^j)- To 

appreciate this inequality it is useful to describe where it fails. Consider the graph metric of if 2,3 • 
Here, the LHS of the inequality is 6 and the RHS is 8, hence i?2,3 violates the pentagonal inequality. 
In the following theorem we show that this "next level" strengthening past the triangle inequalities 
fails to reduce the integrality gap significantly. 

Theorem 5 The integrality gap of SDP ^ is at least 2 — e for any e > 0. 

We give here an outline of the proof (the complete proof appears in the appendix). 

We resort again to Charikar's construction. Recall that by ignoring yo the metric space defined 
by d(i,j) = \\yi — yj\\ 2 is t\ embeddable. Therefore, the only ^i-valid inequalities that may be 
violated are ones containing yo. Hence, we wish to consider a pentagonal inequality containing yo 
and four other vectors, denoted by yi, y2, y3, Y4- Assume first that the partition of the five points in 
the inequality puts yo together with two other points; then, using the fact that d(0, 1) = d(0, 2) = 
d(0, 3) = d(0, 4) and triangle inequality we get that such an inequality must hold. It therefore 
remains to consider a partition of the form ({yi , y2, ys}, {y4, yo})i i n other words we need to show 
that: 

d(l, 2) + d(l, 3) + d(2, 3) + d(0, 4) < d(l, 4) + d(2, 4) + d(3, 4) + d(0, 1) + d(0, 2) + d(0, 3) 

Let q(x) = x 2t + 2t\ 2t ~ 1 x. Recall that every yj is associated with a { — l,l} n unit vector Uj 
and its scaled counterpart u'^. After substituting each y, as a function of u^, the inequality gets 
the form 

E = • u' 2 ) + q(u[ • u' 3 ) + q(u' 2 ■ u 3 ) - • u' 4 ) - q(u' 2 • u' 4 ) - q(u' 3 • u 4 ) > -2g(l)/(l + 0) (11) 

The rest of the proof analyzes the minima of the function E and ensures that ([lift is satisfied 
at those minima. We proceed by first partitioning the coordinates of the original hypercube into 
four sets according to the sign of ui,U2 and 113 on these coordinates. We let Pq be the set of 
coordinates in which all three vectors assume negative value, and Pi(P2,Pz) be the coordinates 
on which 111(112,113) is positive and the other two vectors negative. Without loss of generality the 
union of these four sets is the set of all coordinates. Next, U4 is considered. Using the convexity 
of the polynomial q we show that we may assume that 114 is either all 1 or all —1 on each set Pj. 
Stronger properties of q ensure that U4 is —1 on the Pq coordinates. 

The cases left to check now are characterized by whether U4 is 1 or — 1 on each of Pi , P2 , P3 ■ By 
symmetry, all we need to know is the number of blocks Pj on which U4 takes the value 1. Hence we 
are left with four cases and we use calculus arguments to analyze each case separately. Our analysis 
shows that in all cases the function E is minimized when U4 identifies with one of ui , 112 , U3 ; but 
then it can be easily seen that the pentagonal inequality reduces to a triangle inequality which we 
know is valid. 
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5 Lower bound for embedding negative type metrics into l\ 



While, in view of Theorem Charikar's metric does not supply an example that is far from £\, 
we may still (partly motivated by Theorem |2J) utilize the idea of "tensoring the cube" and then 
adding some more points in order to achieve negative type metrics that are not t\ embeddable. 
Our starting point is an isoperimetric inequality on the cube that generalizes the standard one, and 
under certain conditions provides better edge expansion guarantee. Such a setting is also relevant 
in |17l I19j where harmonic analysis tools are used to bound expansion; there tools are unlikely to 
be applicable to our case where the interest and improvements lie in the constants. 

Theorem 1 (Generalized Isoperimetric inequality) For every set S C Q n; 

\E(S,S C )\ > \S\(n-log 2 \S\)+p(S). 

where p(S) denotes the number of vertices u G S such that — u G S. 

Proof. We use induction on n. Divide Q n into two sets V\ = {u : ui = 1} and V—\ = {u : ui = 
-1}. Let Si = S n V\ and S_i = 5n VLi. Now, E(S, S c ) is the disjoint union of E(S U Vy \ Si), 
E(S-i, V-i \ S-i), and E(S±, V-\ \ S-i) U E(S-±, V\ \ Si). Define the operator^ on Q n to be the 
projection onto the last n — 1 coordinates, so for example Si = {u G Q n -\ ■ (1> u ) £ Si}. It is easy 
to observe that 

\E(Si,V-i\S-i)UE(S-i,Vi\Si)\ = \SiAS-i\, 

We now argue that 

P (S) + |5i| - < P (sl)+p(sZi) + I^AS^I. (12) 

To prove (fT2*)l . for every u G {—1, l} n_1 , we show that the contribution of (1, u), (1, — u), (—1, u), 
and (— 1,— u) to the right hand side of (|12|) is at least as large as their contribution to the left 
hand side: This is trivial if the contribution of these four vectors to p(S) is not more than their 
contribution to p(Si), and p(S-i). We therefore assume that the contribution of the four vectors 
to p(S), p(Si), and p(S-i) are 2, 0, and 0, respectively. Then without loss of generality we may 
assume that (1, u), (—1, — u) G S and (1, — u), (—1, u) S, and in this case the contribution to both 
sides is 2. 

By induction hypothesis and (|12|) we get 

\E(S,S C )\ = \E&,Q n „i\K\ + \E(sZi,Q n _i\sZi\ + \S\ASZi\ 

> \Si\(n - 1 - log 2 \Si\)+p(Si) + |SLi|(n - 1 - log 2 |5_i|) + p(sZ~i) + |^A&^| 

> |5|n - \S\ - (\Si\ log 2 \Si\ + log 2 \S-x\)+p(Sl) +p(sZi) + \S\ASZ\ 

> \S\n - (2|5_i| + |5i| log 2 |5i| + |S_i| log 2 l^l) +p(S). 

Now the lemma follows from the fact that 2|5_i| + log 2 \Si\ + IS-il log 2 | -S' 1 1 < \S\ log 2 

which can be obtained using easy calculus. ■ 
We call a set S C Q n symmetric if — u G S whenever u G S. Note that p(S) = \S\ for symmetric 
sets S. 

Corollary 1 For every symmetric set S C Q n 

\E(S,S C )\ >|5|(n-log 2 |5| + l). 
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The corollary above implies the following Poincare inequality. 

Proposition 1 (Poincare inequality for the cube and an additional point) Let f : Q n U {0} — > M. m 
satisfy that /(u) = /(— u) for every u G Q n . Then the following Poincare inequality holds. 

l.|(4a + 1/2) Y, ll/(u)-/(v)||i<« E \\m-f(y)\\l + \ £ ll/(«)-/(0)lli ( 13 ) 

u,vgQ„ uv6_B ueQ n 

Proof. It is well known that instead of considering / : V — > £±, it is enough to prove the above 
inequality for / : V — > {0,1}. Further, we may assume without loss of generality that /(0) = 0. 
Associating S with {u : /(u) = 1}, Inequality (|T3|) reduces to 

i-^(4a + l/2)|S||S c | <a\E(S,S c )\ + \S\/2, (14) 

where 5 is a symmetric set, owing to the condition /(u) = /(— u). From the isoperimetric inequality 
of Theorem n we have that \E(S, S c )\ > \S\(x + 1) for x = n — log 2 \S\ and so 

^ a(x + l) + l/2 ^ ^| 5 || 5C)| < a \ E{S ,S c )\ + |5|/2. 

It can be verified (See LemmaEJ that ^^^-x^ 2 attains its minimum in [l,oo) at x = 3 whence 
a ^ X i^2-^ 2 — 4 "7/g //2 ) an d Inequality (jTl)) is proven. ■ 

Theorem 6 Let V = {u : u G Q n } U {0} ; where u = u <g> u. T/ien /or i/ie semi-metric space 
X = (V, || • || 2 ) we have c\{X) > 1 — e, for every e > and sufficiently large n. 

Proof. We start with an informal description of the proof. The heart of the argument is showing 
that the cuts that participate in a supposedly good l\ embedding of X cannot be balanced on one 
hand, and cannot be imbalanced on the other. First notice that the average distance in X is almost 
double that of the distance between and any other point (achieving this in a cube structure 
without violating the triangle inequality was where the tensor operation came in handy). For a 
cut metric on the points of X, such a relation only occurs for very imbalanced cuts; hence the 
representation of balanced cuts in a low distortion embedding cannot be large. On the other hand, 
comparing the (overall) average distance to the average distance between neighbouring points in 
the cube shows that any good embedding must use cuts with very small edge expansion, and such 
cuts in the cube must be balanced (the same argument says that one must use the dimension cuts 
when embedding the hamming cube into i\ with low distortion). The fact that only symmetric cuts 
participate in the l\ embedding (or else the distortion becomes infinite due to the tensor operation) 
enables us to use the stronger isoperimetric inequality which leads to the current lower bound. We 
proceed to the proof itself. 

We may view X as a distance function with points in u G Q n U {0}, and d(u, v) = ||u — v|| 2 . 
We first notice that X is indeed a metric space, i.e., that triangle inequalities are satisfied: notice 
that X \ {0} is a subset of {—1, l} n . Therefore, the square Euclidean distances is the same 
(upto a constant) as their i\ distance. Hence, the only triangle inequality we need to check is 
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1 1 u — v|| 2 < ||u — 0|| 2 + || v — 0|| 2 , which is implied by the fact that u • v = (u • v) 2 is always 
nonnegative. 

For every u, v £ Q n , we have d(u, 0) = ||u|| 2 = u • u = (u • u) 2 = n 2 , and d(u, v) = ||u — v|| 2 = 
1 1 xi 1 1 2 + ||v|| 2 — 2(u- v) = 2n 2 — 2(u- v) 2 . In particular, if uv G E we have d(u, v) = 2n 2 — 2(n — 2) 2 = 
8(n — 1). We next notice that 

d(u, v) = 2 2n x 2n 2 - 2 ^(u • v) 2 = 2 2n x 2n 2 - 2 u, Vi ) 2 = 2 2n (2n 2 - 2n), 

u,ve<2„ U,V U.V i 

as X^u v u i v i u j v j is 2 2n when i = j, and otherwise. 

Let / be a nonexpanding embedding of X into t\. Notice that 

d(u, -u) = 2n 2 - 2(u • v) 2 = 0, 

and so any embedding with finite distortion must satisfy /(u) = /(— u). Therefore Inequality (jl3j) 
can be used and we get that 

ttSuv^ ll/(u) ~ mh + 1 E ueQn 117(a) - /(Q)lli . 8,, , 1/9 . hK . 

j-= — — — > -(4a + 1/2). (15) 

2^E u ,veQjl/( u )-/( v )lli 7 

On the other hand, 

a Euve E d ( u > v ) + 5 SueQ„ d ( u ' °) _ 8a(n 2 - n) + n 2 



^E Ui veq„%v) 2n 2 -2n 



4q + l/2 + o(l). (16) 



The discrepancy between (|15() and (|16() shows that for every e > and for sufficiently large n, 
the required distortion of V into t\ is at least 8/7 — e. ■ 



6 Conclusion 

We have considered the metric characterization of SDP relaxations of Vertex Cover and specif- 
ically related the amount of a l\ information" that is enforced with the resulting integrality gap. 
We showed that a 2 — o(l) integrality gap survives in the feasible extreme of this range, while no 
integrality gap exists in the most powerful (and not feasible) extreme, i.e., when l\ embeddability of 
the solution is enforced. We further demonstrated that integrality gap is not a continuous function 
of the possible distortion that is allowed, as it jumps from 1 to 2 — o(l) when the allowed distortion 
changes from 1 to 1 + 5. These results motivated us to find a negative type metric that does not 
embed well to £%, which is a fairly elusive object. The natural extensions of these results are to (i) 
check whether the addition of more /c-gonal inequalities (something that can be done efficiently for 
any finite number of such inequalities) can reduce the integrality gap or prove otherwise. We in fact 
conjecture that the integrality gap is still 2 — o(l) when we impose the condition that the solution 
is a Hypermetric. It is interesting to note that related questions are discussed in the context of LP 
relaxations of Vertex Cover in [3] (ii) use the nonembeddability construction and technique in 
Section to find negative type metrics that incur more significant distortion when embedded into 
t\. It is interesting to investigate whether (and how) our findings are connected to the question 
of the power of Lift and Project methods; specifically the one that is defined with the Positive 
Semi Definiteness constraints, also known as LS + (see [2] for relevant discussion). Notice that k 
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rounds of LS + will imply all fc-gonal inequalities, but may be much stronger. In fact, we do not 
even know whether applying two rounds of LS + does not lead to an integrality gap of 2 — 0(1) . 
Last, we suggest looking at connections of ^i-embeddability and integrality gaps for other NP-hard 
problems. Under certain circumstances, such connections may be used to convert hardness results 
of combinatorial problems into hardness results of approximating i\ distortion. 
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7 Appendix 

7.1 Proof of Theorem El 

Let Yi and u'j be defined as in Section 14.11 To prove Theorem El it is sufficient to prove that 
c i({y« : * ^ {0} U V}, || • || 2 ) = 1 + o(l). Note that every coordinate of yi for all i £ V takes at most 
two different values. It is easy to see that this implies ci({yt : i E V}, \\ ■ || 2 ) = 1. In fact 



1-/3 2 



\ 



. . . ® n't, 
2t times 



2U 2 *-V. 



ii 



(17) 
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is an isometry from ({yj : i £ V}, \\ ■ || 2 ) to l\. For i £ V, we have 

ll/Wlli = ( - t x ^ + -^=2£A 2 ^-L + o) = i-f x (2 + 4a 2 *- 1 ) (18) 

Since (3 = recalling that A = 1— i, it is easy to see that for every i € V, lim^oo ||/(yi)||i = 2. 

On the other hand for every i £ V 

lim || yi - y || 2 = lim 2 - 2( y< • y ) = lim 2 - 2(3 = 2. 

t—fOO t^oo t— >oo 

So if we extend / to {yj : i £ V U {0}} by defining /(yo) = 0, we obtain a mapping from 
({y« : i £ V U {0}}, || • || 2 ) to i\ whose distortion tends to 1 as £ goes to infinity 

7.2 Proof of Theorem |U 

We show that the Charikar's construction satisfies formulation (|10|) . By [£j and from the discussion 
in Section 14.11 it follows that all edge constraints and triangle inequalities of the original points 
hold. Hence we need only consider triangle inequalities with at least one nonoriginal point. By 
homogeneity, we may assume that there is exactly one such point. 

Since all coordinates of yj for i > assume only two values with the same absolute value, it 
is clear that not only does the metric they induce is t\ but also taking ±y, for % > gives an l\ 
metric; in particular all triangle inequalities that involve these vectors are satisfied. In fact, we 
may fix our attention to triangles in which ±yo is the middle point. This is since 

(±y* - ±yj) • (yo - ±y?) = (±yj - yo) • (Ty» - yo)- 

Consequently, and using symmetry, we are left with checking the nonnegativity of (yj + yo) • 
(yj + yo) and (-yj - y ) • (y^ - y ). 

(y< + yo) • (yj + yo) = l + yo • (y< + yj) + yi ■ y 3 > l + W + p 2 - (1 - (3 2 ) = 2/3(1 + (3)>o. 

Finally, 

(-y* - yo) • (yj - yo) = l + yo • (y* - yj) - y« • yj = i - y» • yj > o 

as yi,yj are of norm 1. 

7.3 Proof of Theorem [U 

Again we show that the metric space used in Charikar's construction satisfies the pentagonal in- 
equalities. As explained in the outline of the proof in Section 14.21 we need to consider only pen- 
tagonal inequalities in which the partition of the vectors is of the form ({yi, y2, y3j-, {y4, yo})- 
Therefore we need to show that: 

d(l, 2) + d(l, 3) + d(2, 3) + d(0, 4) < d(l, 4) + d(2, 4) + d(3, 4) + d(0, 1) + d(0, 2) + d(0, 3) 

As the vectors are of unit norm, it is clear that d(Q,i) = 2 — 2(3 for all i > and that d(i,j) = 
2—2yiyj. Recall that every yj is associated with a { — 1, 1} vector Uj and with its normalized multiple 
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u^. Also, it is simple to check that • yj = [5 2 + (1 — /3 2 )g(u'j ■ u'j)/q(l) where q(x) = x 2t + 2A 2i 1 x. 
After substituting this in the previous expression, it is easy to see that our goal is then to show: 

E = g(ui • u' 2 ) + g(ui • u' 3 ) + q(u' 2 • u' 3 ) - q(u[ • u' 4 ) - q(u' 2 • U 4 ) - q(u' 3 • u 4 ) > -2g(l)/(l + 0) (19) 

We partition the coordinates of the original hypercube into four sets according to the values 
assumed by Ui , u 2 and 113 . Assume without loss of generality that in any coordinate at most one 
of these get the value -1 (otherwise multiply the values of the coordinate by —1). We get four sets, 
Pq for the coordinates in which all three vectors assume value -1, and Pi, P2, P3 for the coordinates 
in which exactly ui,U2,U3 respectively assumes value 1. 

We now consider U4. We argue that without loss of generality we may assume that U4 is "pure" 
on each of the Pq, P\, P 2 , P3; in other words it is either all 1 or all —1 on each one of the them. 
Assume for sake of contradiction that there are w coordinates in Pq on which U4 assumes value 
— 1, and that < w < \Pq\- Let u^" (similarly uj) be identical to U4 except we replace one 1 in 
Po by —1 (replace one —1 in Pq by 1). We show that replacing U4 by U4" or by u 4 we decrease 
the expression E. This means that the original 114 could not have been a choice that minimized E 
and the claim follows. Let pi = u, • U4, pf = u' { • (u 4 )' and p^ = u' { • (u 4 )' for i = 1,2, 3. Notice 
that the above replacement only changes the negative terms in (|19j) so our goal now is to show that 

max{^q(p+),2^q{Pi )} > % —^ — — = 

i=i i=i 

q(pf) + qipJ) . (vt + Pi \ / n 

i=l i=l V ' i=l 

where the second last inequality is using the (strict) convexity of q. This of course applies to Pi, P2 
and P3 in precisely the same manner. The above characterization significantly limits the type of 
configurations we need to check but regretfully, there are still quite a lot of cases to check. 
For Pq , we can in fact say something stronger than we do for Pi , P 2 , P3 : 

Proposition 2 If there is a violating configuration, there is one with 114 that has all the Pq coor- 
dinates set to —1. 

This is not a surprising fact; in fact if q was a monotone increasing function this would be obvious, 
but of course the whole point behind q is that it brings to minimum some intermediate value (—A) 
and hence can not be increasing. The convexity of q is also not enough, and one should really utilize 
the exact properties of q. We postpone the proof till the end and continue our analysis assuming 
the proposition. 

The cases left to check now are characterized by whether U4 is 1 or —1 on each of Pi,P2,P3. 
By symmetry all we really need to know is 

£(u 4 ) = \{i : u 4 is 1 on P}| 

If £(u 4 ) = 1 it means that U4 is the same as one of ui,U2 or 114 hence the pentagonal inequality 
reduces to the triangle inequality, which we have already shown is valid. If £( u 4) = 3, it is easy to 
see that in this case u' x u 4 = u 2 u 3 , and likewise u 2 u 4 = u^Ug and u 3 u 4 = u^u^ hence E is for 
these cases, which means that the inequality 1191 is satisfied. 
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We are left with the cases £(114) E {0, 2}. 
Case 1: £(u 4 ) = 

Let x = = ~\P2\,z = -l-Pal- Notice that x + y + z = |(|Pi| + |P 2 | + |P 3 |) < 2, as these 

sets disjoint. Now, think of 

E = E{x, y, z) = q(l - (x + y)) + q(l - (x + z)) + g(l - (y + «)) - g(l - a;) - g(l - y) - g(l - z) 

as a function from M 3 to M, and we will show the (stronger than necessary) claim that E achieves 
its minimum in {(2, y, z) € R 3 : x + y + z < 2} at points where either x,y or z are zero. Assume 
without loss of generality that < x < y < z. 

We consider the function g(5) = E(x — S,y + S,z). It is easy to see that </(0) = q'(l — (x + 
z)) — q'(l — (y + z)) — q'(l — x) + q'(l — y). Our goal is to show that ^'(0) is nonpositive, and in 
fact that g'{5) < for every 5 S [0,x]. This, by the Mean Value Theorem implies that 



and in particular that in this case we may assume that x = 0. This means that yi = y4 which 
reduces to the triangle inequality on yo , y 2 ; Y3 • 

Note that in q'(l — (x + z)) — q'{l — (y + z)) — q'{l — x) + q'(l — y), the two arguments in the 
terms with positive sign have the same average as the arguments in the terms with negative sign, 
namely fi = 1 — (x + y + z)/2. We now have </(0) = q'(fi + b) — q'(fi + s) — q'(n — s) + q'(fi — b), 
where b = (x — y + z)/2, s = (— x + y + z)/2. 



Notice that = 1 — (x + y + z)/2 > 0. Further, since x < y, we get that s > b > 0. This means 
that ^'(0) < 0. It can be easily checked that the same argument holds if we replace x,y by x — 5 
and y + 5. Hence g'(5) < for every 5 G [0, x], and we are done. 
Case 2: £(u 4 ) = 2 

The expression for P is now: 

^(«, V, z) = 9(1 - (z + y)) + 9(1 + z)) + g(l - (y + z)) - g(l - x) - q(l - y) - q(l - (x + y + z)) 

Although E(x, y, z) is different than in Case 1, the important observation is that if we consider 
again the function g(§) = E(x — 5,y + 5, z) then the derivative g'{8) is the same as in Case 1 and 
hence the same analysis shows that E(Q,x + y,z) < E(x,y,z). Therefore we may assume that 
x = 0. This means that y 2 identifies with y4 and the inequality reduces to the triangle inequality 
on yo,yi,y3- 

It now remains to prove Proposition [21 

Proof of Proposition |5] : Fix a configuration for 111,112,113 and as before let x = —\Pi\, 
y = — I 1 ) -2 = ^ I-F3 1 , and w = ^\Pq\, where w > 0. Consider a vector 114 that has all — l's in 
Pq. Let Hi = -iT(iij,U4), where H(iij,U4) is the Hamming distance from 114 to Uj, i = 1,2,3. It 



E(0,x + y,z) < E(x,y,z) 



9(0) 




i even 
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suffices to show that replacing the Po _ P ar t of U4 with l's (which means adding w to each Hi) does 
not decrease the LHS of El i-e., that: 

9 (1 - Hi) + q(l - H 2 ) + q(l - H 3 ) > q(l - (Hi + w)) + 9(1 - (H 2 + w)) + q(l - (H 3 + w)) (20) 

Because of the convexity of 9 as explained before, the cases that we need to consider are 
characterized by whether U4 is 1 or — 1 on each of P± , P 2 , P 3 ■ By symmetry there are 4 cases to 
check, corresponding to the different values of £(114). In some of these cases, we use the following 
argument: consider the function g(5) = 9(1 — (Hi + 5)) + 9(1 — (H 2 + 5)) + q(l — (H 3 + 5)), where 
5 £ [0,uu\. Let di = 1 - {Hi + 5). The derivative g'(5) is: 

g'(5) = -(9>i) + q'(a 2 ) + q'(a 3 )) = -2t(af~ l + af~ l + af~ l + 3A 2 '- 1 ) 

If we show that the derivative is negative for any 5 £ [0, w], that would imply that g(0) > g(w) and 

hence we are done since we have a more violating configuration if we do not add w to the Hamming 

distances. 

Case 1: £(u 4 ) = 

In this case Hi = x, H 2 = y, H3 = z. Note that x + y + z + w = 2. Hence, if Hi > 1 for some 
i, say for Hi, then H 2 + 5 < 1 and ^3 + 8 < 1. This implies that a 2 > and 03 > 0. Thus 

g'(S) < -(-1 + 3A 2 *" 1 ) < 1 - 3/e < 

since A 2 *" 1 = (1 — ^j) 2 *" 1 >l/e. Hence we are done. 

Therefore, we can assume that Hi < 1 for all i, i.e., 1 — Hi > 0. We now compare the LHS and 
RHS of ()20j) . In particular we claim that each term 9(1 — Hi) is at least as big as the corresponding 
term 9(1 — (Hi + w)). This is because of the form of the function 9. Note that 9 is increasing in 
[0, 1] and also that the value of 9 at any point x E [0, 1] is greater than the value of 9 at any point 
y € [—1,0). Therefore since 1 — Hi > and since we only subtract w from each point, it follows 
that (HOJ) holds. 
Case 2: £(u 4 ) = 1 

Assume without loss of generality that U4 is 1 on Pi only. In this case, Hi = 0, H 2 = x + y 
and H 3 = x + z. The LHS of inequality (JliJ) is now: LHS = q(l) + 9(1 - (x + y)) + 9(1 - (x + z)), 
whereas the RHS is: 

RHS = 9(1 -w) + q(l-(x + y + w)) + 9(1 - (x + z + to)) = 9(1 - w) + q(-l + z) + + y) 

by using the fact that x + y + w = 2 — z. 

Let ai = 1, a 2 = 1 — (x + y), a 3 = 1 — (x + z). The LHS is the sum of the values of 9 at 
these points whereas the RHS is the sum of the values of 9 after shifting each point ai to the left 
by w. Let a\ = ati — w. The difference A = 9(1) — 9(1 — w) will always be positive since 9(1) 
is the highest value that 9 achieves in [—1,1]. Therefore to show that (J20[) holds it is enough to 
show that the potential gain in 9 from shifting a 2 and a 3 is at most A. Suppose not and consider 
such a configuration. This means that either q(a' 2 ) > q(a 2 ) or q(a' 3 ) > q(a 3 ) or both. We will 
consider the case that both points achieve a higher value after being shifted. The same arguments 
apply if we have only one point that improves its value after subtracting w. Hence we assume that 
9(0/2) > 9(02) and q(a' 3 ) > q(a 3 ). Before we proceed, we state some properties of the function 9, 
which can be verified by simple calculations: 
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Claim 1 The function q is decreasing in [— 1, —A] and increasing in [—A, 1]. Furthermore, for any 
2 points x,y such that x £ [—1,2 — 3A] and y > 2 — 3A, q(y) > q(x). 

Using the above claim, we can argue about the location of 02 and 03. If 02 > 2 — 3A > — A, 
then £7(0:2) > q{a' 2 ). Thus both 02 and 03 must belong to [—1,2 — 3A] = [—1, —1 + Jj]. We will 
restrict further the location of 02 and 03 by making some more observations about q. The interval 
[—1,2 — 3 A] is the union of A\ = [—1, — A] and A2 = [—A, 2 — 3A] and we know q is decreasing in A\ 
and increasing in A2. We claim that 02,03 should belong to A\ in the worst possible violation of 
(|2U|). To see this, suppose 02 £ A2 and 03 £ A2 (the case with 02 S ^2, «3 6 A\ can be handled 
similarly). We know that q is the sum of a linear function and the function x 2t . Hence when we 
shift the 3 points to the left, the difference q(l) — q(l — w) is at least as big as a positive term 
that is linear in w. This difference has to be counterbalanced by the differences q{oL 2 ) — (7(02) and 
q(a' 3 ) — (7(03). However the form of q ensures that there is a point (2 £ A\ such that (7(0:2) = <l{Qi) 
and ditto for 03. Hence by considering the configuration where 02 = C2 and 03 = C3 we w hl have 
the same contribution from the terms q(a 2 ) — (7(02) and q(a' 3 ) — (7(03) and at the same time a 
smaller w. 

Therefore we may assume that w < \A±\ = which is a very small number. By substituting 
the value of q, (|2U|) is equivalent to showing that: 

1 - (1 - w) 2t + Qt\ 2t ~ l w > (o 2 - w) 2t - af + (o 3 - w) 2t - af 

It is easy to see that the difference 1 — (1 — w) 2t is greater than or equal to the difference 
(«2 — w) 2t — «2*- Hence it suffices to show: 

etA 2 *- 1 ^ > (o 3 -w) 2t -af 

Since w is small, we estimate the difference (03 — w) 2t — a 2t using the first derivative of x 2t (the 
lower order terms are negligible). Thus the RHS of the above inequality is at most 2t\as\ 2t ^ 1 w, 
which is at most 2tw. But the LHS is: 

6tA 2 ' _1 ?i; > 6t/ew > 2tw 

Therefore no configuration in this case can violate (|2U|) and we are done. 
Case 3: £(u 4 ) = 2 

Assume that 114 is 1 on Pi and i"2- Now Hi = y, H2 = x, H3 = x + y + z. The LHS and RHS 
of (|20|) are now: 

LHS = q{l-y)+q{l-x)+q{l-{x + y + z)) 
RHS = q(l - (y + w)) + q(l - (x + w)) + q{-l) 

As in case 2, let a± = 1 — y, 02 = 1 — x and 03 = 1 — (x + y + z) be the 3 points before shifting by 
w. First note that either a± > or 02 > 0. This comes from the constraint that x + y + z + w = 2. 
Assume that a.\ > 0. Hence q(ai) — q(a\ — w) > 0. If 02 [—1,2 — 3A] then we would be done 
because by Claim ^(02) — 9(02 — w) > 0. Therefore the only way that (12(H) ca n be violated is 
if the nonlinear term (03 — w) 2t — af can compensate for the loss for the other terms. It can be 
easily checked that this cannot happen. Hence we may assume that both 02, 03 £ [— 1, 2 — 3A] and 
that (7(02 — w) > (7(02)5 (7(03 — w) > (7(03). The rest of the analysis is based on arguments similar 
to case 2 and we omit it from this version. 

Case 4: £(u4) = 3 This case can also be done using similar arguments with case 2 and 3. ■ 
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7.4 A technical lemma 

Lemma 1 The function f(x) = a ^ x ^}^^ 2 for a = 14 _ h g 2 ln 2 attains its minimum in [1, oo] at x = 3. 
Proof. The derivative of / is 

1 - 2" x - (a(x + 1) + 1/2) ln(2)2- ;E 
(1 - 2-^) 2 ' 

It is easy to see that /'(3) = 0, /(l) = 4a + 1 > 8/7, and lirriaj^oo f(x) = oo. So it is sufficient to 
show that 

g(x) = l-2~ x - (a(x + 1) + 1/2) ^(2)2^, 
is an increasing function in the interval [1, oo). To show this note that 

g'(x) = 2' x ln(2) (1 - a + ax ln(2) + a ln(2)) > 0, 

for x > 1. ■ 
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